Insights into Analogy Completion from the Biomedical Domain

نویسندگان

  • Denis Newman-Griffis
  • Albert M. Lai
  • Eric Fosler-Lussier
چکیده

Analogy completion has been a popular task in recent years for evaluating the semantic properties of word embeddings, but the standard methodology makes a number of assumptions about analogies that do not always hold, either in recent benchmark datasets or when expanding into other domains. Through an analysis of analogies in the biomedical domain, we identify three assumptions: that of a Single Answer for any given analogy, that the pairs involved describe the Same Relationship, and that each pair is Informative with respect to the other. We propose modifying the standard methodology to relax these assumptions by allowing for multiple correct answers, reporting MAP and MRR in addition to accuracy, and using multiple example pairs. We further present BMASS, a novel dataset for evaluating linguistic regularities in biomedical embeddings, and demonstrate that the relationships described in the dataset pose significant semantic challenges to current word embedding methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimization of Heat Transfer Enhancement of a Domestic Gas Burner Based on Pareto Genetic Algorithm: Experimental and Numerical Approach

The present study attempts to improve heat transfer efficiency of a domestic gas burner by enhancing heat transfer from flue gases. Heat transfer can be augmented using the obstacles that are inserted into the flow field near the heated wall of the domestic gas burner. First, to achive the maximum efficiency, the insert geometry is optimized by the multi-objective genetic algorithm so that heat...

متن کامل

Aerodynamic Noise Computation of the Flow Field around NACA 0012 Airfoil Using Large Eddy Simulation and Acoustic Analogy

The current study presents the results of the aerodynamic noise prediction of the flow field around a NACA 0012 airfoil at a chord-based Reynolds number of 100,000 and at 8.4 degree angle of attack. An incompressible Large Eddy Simulation (LES) turbulence model is applied to obtain the instantaneous turbulent flow field. The noise prediction is performed by the Ffowcs Williams and Hawkings (FW-...

متن کامل

Adapting an NER-System for German to the Biomedical Domain

In this paper, we report the adaptation of a named entity recognition (NER) system to the biomedical domain in order to participate in the ”Shared Task Bio-Entity Recognition”. The system is originally developed for German NER that shares characteristics with the biomedical task. To facilitate adaptability, the system is knowledge-poor and utilizes unlabeled data. Investigating the adaptability...

متن کامل

A Methodology for Discovering Structure in Design Databases

Design by analogy, in which designers draw inspiration from cross-domain design solutions, is a promising methodology for product development. This work attempts to leverage the existing design solutions within a repository, combined with an exploration of inherent structural forms that can be discovered based on the content and similarity of that data, in order to gain useful insights into the...

متن کامل

Additional Insights Into Problem Definition and Positioning From Social Science; Comment on “Four Challenges That Global Health Networks Face”

Commenting on a recent editorial in this journal which presented four challenges global health networks will have to tackle to be effective, this essay discusses why this type of analysis is important for global health scholars and practitioners, and why it is worth understanding and critically engaging with the complexities behind these challenges. Focusing on the topics of problem definition ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017